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Abstract. There has been a growing interest in fully integrating Phys- 
ical Unclonable Function (PUF) for cryptographic primitives, or keyless 
encryption. Keyless primitives do not store key information during the 
entire encryption and decryption phase, providing full security against 
volatile and non-volatile memory attacks. The concept of keyless encryp- 
tion using ReRAM PUF is a relatively new concept, and the security 
aspect of the protocol has not been tested yet. In this paper, we use 
statistical models to analyze the randomness of the protocol and its re- 
sistance against frequency attacks. 


Keywords: Keyless Encryption, Physically Unclonable Functions, ReRAM, 
Binomial Distribution, Leave-one-out cross-validation 


1 Introduction 


Classical cryptography uses keys for encryption and decryption. Key generation, 
key exchange, and key storage are complex problems. Hackers try to exploit the 
key to attack a system. Attacks based on differential power analysis can extract 
cryptographic keys during the encryption and decryption process. Quantum ad- 
versaries are targeting key extractions as well [1]. Internet of things (IoT) devices 
are being used more and more in everyday life. It has become an inevitable part 
of human life, which can also threaten one’s privacy since it might contain infor- 
mation in the communication that a third party should not be aware of. Hence, 
securing IoT devices is critical [2]. 

Another drawback of using keys for cryptography, however, is the large mem- 
ory space allocated for key storage in devices. Most of the IoT devices do not 
possess ample memory space due to cost constraints and limited power supply 
[3-5]. This means that the problem of longer secret keys and strong cryptogra- 
phy systems can be hard to implement in IoT devices. These obstacles were a 
motivation to develop a keyless cryptography scheme. 

Kumar [6] and Chandrasekaran [7] proposed a keyless authentication pro- 
tocol, which uses a challenge-response pair to define the success or failure of 


2 Dina Ghanai et al. 


authentication. A keyless encryption scheme, which uses memristors technology, 
is developed by Bertrand Cambou et. al [3]. This protocol uses a memristor 
PUF (see table 1) to create ciphertext without generating a key. In this paper, 
we explore the security of this approach. 


This article is structured into five sections. Section 2 presents a brief back- 
ground information about the keyless protocol that we used and descriptions of 
the tools such as PUF, memristors, and hash functions for the Random number 
generator (RNG), and hash functions. 


Section 3 presents the main objectives of the work and the security threats. 
We worry about frequency analysis attacks; when encrypting the same plain- 
text multiple times, if we visit the same cells repeatedly, the hacker will use 
the frequency analysis technique to detect the plaintext, and the security will 
be compromised. So, this study tries to show if hackers can exploit frequency 
analysis against the keyless encryption scheme using the RERAM PUF protocol. 
In addition, section 3 presents our methods of collecting data and how the data 
was labeled in the experiment. 


Section 4 presents a statistical model that was used to test the probability of 
visiting each cell in the RERAM PUF. We want to know how many times each 
cell was visited to indicate success or failure. Our goal was to determine whether 
specific cells were more or less likely to be visited on average, which could be 
exploited by hackers. 


Section 5 presents the results of the statistical analysis. We provide tables, 
graphs, and results from a comparison among models. 


Table 1. Abbreviations 


PUF Physical Unclonable Functions 

SHA Secure Hash Algorithms 
Message-Digest 

mare (output of the hash function) 

ReRAM Resistive Random-Access Memory 

IoT Internet of things 

SMD Short message-digest 

LMD Longer message-digest 

RNG Random Number Generator 

TRN True random number 

PRN Pseudo-random number 

NIST National Institute of Standards and Technology 

LOO-IC |Leave one out information criterion 
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Fig.1. The architecture schemes for keyless encryption with memristor PUF 
(ReRAM). The handshake means that a True Random Number share among sender 
and receiver before starting encryption and decryption. The ciphertext will be the 
outcome of reading the ReRAM PUF [3]. 


2 Background 


A memristor is a PATA of a memory and a transistor to generate a new 
type of memory [8-10]. Resistive Random-Access Memory (ReRAM) works by 
changing the resistance across a die [11]. It is often referred to as a memristor. 
In 2017, [12] mentioned that “the injection of low currents in cells of memristor 
arrays, can result in dissolvable conductive paths of vari able resistances, and 
can be exploited to design Physical Unclonable Functions (PUFs).” 

Articles [3] and [13] describe how a memristor PUF is used to encrypt plain- 
text without using an encryption key. Through exploiting the features of mem- 
ristor, the output of memristors becomes the cipher. After subjecting the current 
to a specific cell, the output will be the value of resistance. As shown in [3] and 
[13] , resistance values with the current value can make ciphertext. Repeatedly 
visiting the same cell in the memristor can help hackers exploit frequency anal- 
ysis attacks to extract the plaintext. According to [3], the ReRAM cells have 
been used to generate a ciphertext. This paper is going to focus on studying the 
randomness of visiting each cell in the memristor. 

A keyless protocol based on ReRAM PUF uses three main tools: Random 
Number Generator (RNG), Hash function, and ReRAM PUF as shown in 1. 
These tools depend on each other to build the keyless protocol. So, this section 
will give a brief description of the tools before explaining the keyless protocol 
steps [1]. 
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2.1 Tools 


To run the protocol, several tools, including Random Number Generator (RNG), 
Hash function, and memristor PUF, are being used. In this section, we describe 
each tool and how it was used. 


Random Number Generator (RNG) The random number is the number 
that is generated randomly from one of the random number generators. The aim 
of the RNG is to get a unique number without repetition. There are two types 
of RNG, which are the true random number (TRN) and pseudo-random number 
(PRN). The TRN can be generated from the physical processes whereas the PRN 
can be generated mathematically. In this study, we selected TRN because it is 
recommended by the National Institute of Standards and Technology (NIST). 
We did not need to test the RNG’s randomness because it already passes the 
NIST’s tests [14]. 


Hash Function A hash function is defined as a one-way encryption which 
means that reversing the output of the hash will not help to retrieve the input. 
The hash function accepts arbitrary input and gives fixed output. The hash 
function has several common algorithms such as MD5, SHA-1, SHA-2, NTLM, 
and LANMAN [14] [15]. According to [3] and [13], the SHA-3-512 was selected 
to be used in the protocol implementation, which is one of the hash functions 
recommended by the NIST. 


Memristor PUF Physically Unclonable Functions (PUFs) are equivalent 
to human biometrics for a physical device. PUFs are hard to clone, hard to pre- 
dict, and difficult to replicate; but, they have a repeatable behavior. PUF targets 
the nanoscale device parametric variations to create unclonable measurements 
of physical objects. Their ability to generate and store secret information makes 
them a good candidate for security systems [16, 17]. PUFs generate challenge and 
response pairs (CRPs); the challenge is created during the enrollment, whereas 
the response is generated each time authen- tications are needed. The authentica- 
tion occurs when the number of mismatches between the challenge and response 
is low enough [18,19]. Memory structures SRAM, DRAM ReRAM, and MRAM 
are considered as good elements to generate PUFs [18, 20, ?]. 


The keyless protocol that we studied used a memristor PUF, which has a lot 
of active research, such as in the field of artificial intelligence (AI); the memristor 
is used to design artificial neurons or bioelectricity. The idea is that each cell 
should become a programmable node with its resistance adjusted to memorize 
learning patterns. Another field of active memristor research is to design resistive 
random-access memory (ReRAM), to replace mainstream mem ories, DRAM, 
and Flash [9, 21, 22]. Moreover, it has been mentioned that the manufacturing 
of the ReRAMs creates natural variations amongst the cells; these variations 
can be exploited to design a PUF. Therefore, the challenge/response from the 
ReRAM PUF is the resistance value obtained at a particular cell address, where 
a specific current is injected [3, 23-25]. 
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2.2 Keyless Encryption 


As shown in 1, there are several steps that need to be taken to find the address 
to be read. The sender side will combine the random number and the password 
using the exclusive or (XOR). The outcome will be a binary stream of 64 bytes. 
Then the output of XOR will be used as the input of a hash function. The 
outcome of the hash function is 512 bits long according to [3] and [13]. But 
we can say that, according to the type of the hash function, we can know the 
size of the outcome. So, we are going to call the outcome of the hash function 
512 bits long a short message digest (SMD). After getting the SMD, the sender 
side will try to extend it because the length of the message digest decides how 
many characters can be encrypted. To do this, the first n bits of SMD will be 
rotated; each time, the output rotation will feed the Hash function to obtain a 
new SMD; and finally, all SMDs will be combined to get a longer message digest 
(LMD). For encryption, the message digest will be divided into n blocks. Each 
block contains the address (7 bits) and current (3bits). The decimal value of the 
7 bits will vary from 0 to 127, and the decimal value of 3 bits will vary from 0 
to 7. The LMD will be generated from rotating the first 16 bit to get 16 MDs, 
which is equal to 8196 bits. From there, the LMD are going to split to several 
blocks; if the size of each block equal 17 bits, then we can get approximately 
8192bits/17 ~ 480 blocks. Each block works with 2 characters, so the length of 
the plaintext should be no more than 240 characters, unless we rotate more than 
16 bits from the first MD. This protocol can be extended to a longer plaintext. 
Table 2 shows show how much n rotation can produce the size of blocks and 
also the length of plaintext we can accept.in this study, we selected 16 rotations, 
which means the LMD will be 8192 bits; this leads us to obtain 480 blocks. 

The addressable table that we are referring to is a ReRAM PUF. The size 
of the ReRAM data table is 128 by 8. The address defines which row, and the 
r current defines which column to visit. The outcomes of the PUF will be used 
as ciphertext to send over to the receiver. The encryption and decryption steps 
and how the keyless protocol works have been explained in more detail at [3]. 
However, in this study, we want to test the randomness of visiting the RERAM 
PUF cell. It has been used for software protocol implementation of the TRNG 
and hash function that have been recommended by NIST, but it is still not 
certain if the PUF cells are visited equally. [3] mentions that if we encrypt with 
a new random number each time, we should visit different cells in the RERAM 
PUF. So, this paper tries to infer the probability of visiting each cell in the PUF 
with a new random number each time. 


Table 2. Number of rotations needed to generate LMD based on the length of the 
plain text. 





Maximum length of plain text|# of blocks|# of rotation|Length of MD (bit) 
240 480 16 8192 
481 962 32 16384 
963 1926 64 32768 
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3 Objectives 


The issue with this protocol occurs when encrypting the exact plaintext multiple 
times. If we visit the same cells repeatedly, a hacker will use the frequency 
analysis technique to reveal the plaintext, and the security will be compromised. 
So, this study tries to determine if a frequency analysis attack can be exploited 
against a keyless encryption scheme using the ReRAM PUF protocol. In the 
following, we will briefly describe the frequency analysis concept. 

Frequency Analysis Frequency analysis is studying the frequency of each 
letter in each language. For example, in English, “e” is the most frequently 
used letter. Each language has its own proportions of appearance. Language 
characters are slightly different from each other. Therefore, texts written in each 
language have certain common properties, which allow them to be distinguished 
from texts written in other languages. Each language has some popular sequences 
of letters; for example, in English, there are often used vowels such as e, 0, a, 
or the consonant t. Additionally, there are some very rare letters, for example, 
z or x. According to [14], the weakness of the block cipher is that the plaintext 
symbol usually assigns to the same ciphertext symbol, which means that the 
statistical features of plaintext are maintained in the ciphertext. Therefore, each 
time we need to encrypt, the protocol generates a new RN that leads to a different 
MD from hash functions, so we can claim that with each new RN, the protocol 
should visit different cells in the ReRAM PUF. In other words, the probability 
of visiting the cells in ReRAM PUF should be equal. 


4 Methods 


To accomplish this study, we ran a simulation using the encryption protocol, 
and we used statistical models to determine whether cells in the RERAM PUF 
were disproportionately visited. This will help us understand whether hackers 
can exploit frequency analysis in this protocol. Knowing what the data looks 
like helps to figure out which is the best statistical model that should be chosen. 
This section will explain the method of data collection and data analysis. In 
addition, this section will show which statistical model was used. 


4.1 Data 


We conducted an experiment in which we calculated the number of visits for 
each ReRAM cell with different current values. We used the same dataset that 
has been used in [3] and [13]. The data is the output of reading the 128 cells with 
eight different currents. The PUF size that we have used is 128 cell x 8 current 
= 1024 cells, as shown in table 3. The value of each cell is the resistance value. 
We collected data each time we encrypted and decrypted because we counted 
each cell visited in ReRAM PUF. Table 3 shows some cells were visited a few 
times, while others were not visited at all. Table 3 shows an example for only 
visiting the ReRAM PUF cell for one run of encrypting and decrypting. 
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Table 3. Example of raw data for each run. Each cell has 8 different currents (I). The 
number shown in each cell is number of visits. 
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As mentioned earlier, this study would test the randomness of visiting each 
cell. According to [3, 13] the longest acceptable plaintext can be 240 characters. 
Therefore, we loop the same plaintext several runs 10, 100, and 1000 runs. For 
each run, we count how many times each cell in ReRAM PUF is visited as 
illustrated in table 4 column 3. We frame the data as the number of times a 
specific cell is visited in a particular run of experiment, as illustrated in table 4. 


Table 4. Data structure 












































Number of times 

Run Index|Cell Index the cell is visited 
1 1 1 
1 2 1 
1 3 0 
1 4 2 
2 1004 1 
2 1005 0 
991 2 
992 1 
993 0 











Each cell was visited a maximum of 7 times, but was much more likely to 
visited less than 3 times. See figure 2. 
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Fig. 2. The data distribution 


4.2 Statistical Experiment 


To examine the randomness of the protocol, we conducted a regression to esti- 
mate the probability of visiting each cell in the ReRAM PUF. Specifically, we 
conducted a generalized linear mixed effects regression using a binomial error 
distribution. In this case, our goal was to determine the global average proba- 
bility of visiting any specific cell, and to use random effects to determine if any 
particular cells had average visit probabilities that deviated significantly from 
the global average. In other words, if a cell had a random effect significantly dif- 
ferent from zero, this would indicate a significantly higher or lower probability of 
visiting that cell compared to the average cell. Such a finding would provide evi- 
dence that the protocol is not sufficiently random to prevent frequency analysis. 
The model structure is as follows: 


Yer ™ Binomial(n, Par) (1) 


— Yc,r represents the number of visits for each cell c in a particular run of the 
experiment r. 

— n is the number of trials for each run, which is 480 for a 240 character 
message. 
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Table 5. Three model comparison. First model with two random effects. Second model 
with one random effect. Third model with no random effect. 

















Estimated looic E , 
with standard error ete ear 
Model with 
two random effects PRSE i i 
Moderate 18346 +152 0.5 | 1.2 
one random effect 
Model with 18345 +153 15 | 12 
no random effect 

















— Pe, is the probability of visiting each cell for each run. 


We used a logit link with the following linear expression: 


logit( Per) =G@+ e+ Nr (2) 


ā is the global average probability of visiting each cell. ņe is a random effect 
of cell c, the estimation of which helps us understand if we are more or less likely 
to visit a particular cell compared to the global average. 77, is a random effect of 
the experimental run. We included this random effect to account for any non- 
independence of the data from a particular run of the experiment. By definition, 
these random effects follow the assumption of being normally distributed: 


ne ~ Normal(0, ce) (3) 


nr ~ Normal(0, or) (4) 


We used two methods to evaluate if any cells deviated from the global aver- 
age. First, we used model comparison to test whether the addition of the cell and 
run random effects were parsimonious. Specifically, we used the leave one out in- 
formation criterion (LOO-IC) [26] to evaluate model performance between three 
competing models. The first model included both random effects, while second 
model included only the cell random effect, and the third model included no ran- 
dom effects (only the global average probability of visiting a cell) Table 5. If the 
model comparisons show that the simplest model (3) is the most parsimonious, 
then we have evidence that particular cells do not strongly deviate from the 
global average. However, because there are so many cells, this analysis may not 
be sufficiently robust to find a small number of cells with significant deviations. 
Therefore, our second, complementary method was to evaluate whether the 95% 
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credible interval of each random cell effect overlapped zero. If the 95% credible 
interval of a cell’s random effect overlaps zero, then the probability of visiting 
that particular cell does not deviate significantly from the global average. We 
also tested a more conservative 80% credible interval. 

We conducted this regression analysis using a Bayesian framework in the 
open-source statistical programming language, Stan [27], via the R package 
rstanarm [28-30], and we used the loo [31] package to estimate the LOO-IC 
of the competing models. 
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Fig. 3. Model with random cell effect. The error bar shows that none of the 95 percent 
credible intervals significantly deviate from 0. We can infer that there is no cell that 
has a big deviation. The highest value of cell deviation is almost 0.005 which is a 
insignificant value; In other words, it means that all probabilities for cells to be visited 
are close enough to the probability of the global mean. 


5 Results 


Multiple lines of evidence suggest that cells do not significantly vary in the 
average probability of visit, providing evidence that the protocol is sufficiently 
random. First, comparing the LOO-IC values of our statistical models showed 
that the random effects of cell and run did not add significant information to 
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the model (Table 5). In other words, model comparison did not show that the 
probability of visiting specific cells deviated significantly from the global average. 
More convincingly, none of the random cell effects had estimates that deviated 
significantly from zero based on the 95% or 80% credible intervals (see Figure 
3 for a subset of these estimates). This shows that the model was not able to 
detect a single cell with significant deviation. The global probability of visiting 
any specific cell was of course very low. The global intercept @ median and 95% 
credible interval on the logit scale was -6.9 which is equal to 0.001006771 on the 
probability scale. 

The results of this study therefore provide evidence that visiting cells is 
sufficiently random. It is possible that if we ran the experiment millions of times, 
that we could detect much smaller deviations in the probability of visiting each 
cell, but we do not suspect that such small deviations would benefit a hacker. 
We therefore claim that a hacker will not be able to use a frequency analysis 
attack to break the ciphertext. In addition, our study further suggests that it 
is secure to use the ReRAM PUF for keyless encryption. This might be one big 
step to enhance the security of IoT. 


6 Conclusion 


Keyless protocol based on ReRAM PUF shows it is sufficiently secure during 
encryption and decryption. Using PUFs has obstacles that need to be considered. 
One of these obstacles is visiting the cells; studying the randomness of visiting the 
cells and exploring the probability of visiting the cells indicates the ReRAM PUF 
helps eliminate the risk of using frequency analysis attacks against the protocol. 
In general, visiting the same cell several times during the encryption phase makes 
the frequency analysis attack work against the keyless encryption protocol based 
on ReRAM PUF. In this study, we ran a statistical model to test the probability 
of visiting the cells; the binomial model is applied in this study since we aim 
to identify the success or failure of each cell. The outcome of the statistical 
experiment illustrates that visiting cells is entirely random. Therefore, using 
a frequency analysis attack is not effective against keyless encryption protocol 
based on a ReRAM PUF. 
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